Accusoft.OCRXpress.nodejs
User Guide

OCR Xpress™ for Node.js is a full-page OCR engine. Any Bitmap file (BMP) in an uncompressed 1-BPP, 8-BPP, or 24-BPP format can be loaded and processed without any image pre-filtering or pre-processing. OCR Xpress for Node.js converts the input image into a searchable PDF document. One or more images can be built into a single PDF document. OCR Xpress for Node.js also provides a rich API that allows you to access the same internal OCR results used to generate the PDF documents.

In addition to using OCR Xpress for Node.js in an end-to-end product solution for converting full page images into searchable text, there are several other uses in which customers may apply OCR Xpress for Node.js functionality. OCR Xpress for Node.js can also convert an image to a TXT file for archiving searchable text. By archiving the original image with the searchable text file in a database, it can later be retrieved according to the results of searches for key words or phrases in the text file.

For applications that need to access post-OCR data for processing purposes, OCR Xpress for Node.js generates and maintains an internal hierarchical model of the text it finds in an image. Every character is hierarchically tied to the word, text line, text block, region, and page with which it is associated. The same is true of every word, text line, text block, region, and page of the generated document. The rich API allows the application to access this internal hierarchical model or to directly access items in the hierarchy. With OCR Xpress for Node.js, a form reader application can extract data from the form based on its location. The API also provides confidence levels of the text in question so that the application can make content usage decisions based on the confidence that the recognized text is correct.

The OCR Xpress for Node.js toolkit provides the following functionality:

Limitation

Even though OCR Xpress for Node.js is designed to be a stand-alone OCR engine, there are some pre-OCR image processing operations that may need to be performed on the input image to achieve optimal results. For example, OCR Xpress for Node.js does not deskew the input image.

 

For information on how to register and license all your Accusoft components, see Licensing.

 

The User Guide provides information on:   

 

 

 


©2016. Accusoft Corporation. All Rights Reserved.

Send Feedback